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Abstract 

We derive a simple efficient algorithm for Abelian periods knowing all Abelian 
squares in a string. An efficient algorithm for the latter problem was given by 
Cummings and Smyth in 1997. By the way we show an alternative algorithm 
for Abelian squares. We also obtain a linear time algorithm finding all "long" 
Abelian periods. The aim of the paper is a (new) reduction of the problem of all 
Abelian periods to that of (already solved) all Abelian squares which provides 
new insight into both connected problems. 
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1. Introduction 

We present an efficient reduction of the Abelian period problem to the 
Abelian square problem. For a string of length n the latter problem was solved 
in 0(n 2 ) by Cummings and Smyth (jj. The best previously known algorithms 
for the Abelian periods, see [12|, worked in 0(n 2 m) time (where m is the al- 
phabet size) which for large m is 0(n 3 ). Our algorithm works in 0(n 2 ) time, 
independently of the alphabet size. As a by-product we obtain an alternative 
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0(n 2 ) time algorithm finding all Abelian squares and an 0{n) time algorithm 
finding a compact representation of all Abelian periods of length greater than 
n/2, in particular, the shortest such period. 

Abelian squares were first studied by Erdos jTlj, who posed a question on 
the smallest alphabet size for which there exists an infinite Abelian-square- 
free string. An example of such a string over five-letter alphabet was given by 
Pleasants [l6j and afterwards the best possible example over four-letter alphabet 
was shown by Keranen (l3j . 

Juite recently there have been several results on Abelian complexity in words 
, 0, [13 and partial words 0, Q and on Abelian pattern matching jB], 
[lj,ll5|. Abelian periods were first defined and studied by Constantinescu and 
Hie 

We say that two strings are (commutatively) equivalent, and write x = y, if 
one can be obtained from the other by permuting its symbols. In other words, 
the Parikh vectors V(x), V(y) are equal, where the Parikh vector gives frequency 
of each symbol of the alphabet in a given string. Parikh vectors were introduced 
already in Q for this problem. 

A string w is an Abelian k-power if w — X\X% . . . Xk, where 



x\ = x 2 



Xk 



The size of x\ is called the base of the fc-power. In particular w is an Abelian 
square if and only if it is an Abelian 2-power. 

A string x is an Abelian factor of y if V(x) < V(y), that is, each element of 
V(x) is smaller than the corresponding element of V{y). The pair (i,p) is an 
Abelian period of w = w[l, n] if and only if w[i + 1, j] is an Abelian fc-power with 
basep(for some k) andw[l,i] and w [j + 1 , n] are Abelian factors of w[i + l, i+p], 
see Fig. [TJ Here p is called the length of the period. 
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Figure 1: A word of length 25 with an Abelian period (i = 3,p = 6). This period implies 
two Abelian squares: abacbbbaabcb and baabcbbabcab. 

In Section [5] we introduce two auxiliary tables that we use in computing 
Abelian squares and powers. Next in Section [3] we show new 0(n 2 ) time algo- 
rithms for all Abelian squares and all Abelian periods in a string and a reduction 
between these problems. 

Finally in Section U we present an 0(n) time algorithm finding a compact 
representation of all "long" Abelian periods. Define 

MinLong{i) = min{p > n/2 : {i,p) is an Abelian period of w}. 

If no such p exists, we set MinLong(i) — oo. All long Abelian periods are of the 
form (i,p) where p > MinLong(i), the table MinLong is a compact 0{n) space 
representation of potentially quadratic set of long Abelian periods. 
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2. Auxiliary tables 

Let wbea string of length n. Assume its positions are numbered from 1 to n, 
w = W1W2 ■ ■ ■ w n . By w[i, j] we denote the factor of w of the form WiWi+i . . .Wj. 
Factors of the form w[l, i] are called prefixes of w and factors of the form w[i, n] 
are called suffixes of w. 

We introduce the following table: 

head(i,j) = minimum fc such that V(w[i,j\) < V(w\j + l,j + fc]). 

If no such k exists, we set head(i,j) = 00, and if j < i, we set head(i,j) = 0. In 
the algorithm below we actually compute a slightly modified table head '(i, j) = 
j + head(i,j). 

Example 1. For the infinite Fibonacci word T = abaababaabaababaababaa . . . 
the first several values of the table head(l,i) are: 



i 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 






a 


b 


a 


a 


b 


a 


b 


a 


a 


b 


a 




head(l, i) 


2 


3 


3 


5 


5 


6 


8 


8 


10 


10 


11 





We have here Abelian square prefixes of lengths 6, 10, 12, 16, 20, 22. 

We show how to compute the head' table in 0(n 2 ) time. The computation 
is performed in row-order of the table using the fact that it is non-decreasing: 

Observation 2. For any 1 < i < j < n, head < head'(i,j + 1). 

We assume that the alphabet of w is S = {1, 2, . . . , m} where m < n. For a 
Parikh vector Q, by Q[i] for i = 1, 2, . . . , m we denote the number of occurrences 
of the letter i. For two Parikh vectors Q and R, we define their Parikh difference, 
denoted as Q — R, as a Parikh vector: (Q — R)[i] = Q[i] — R[i]. 

In the algorithm we store the difference Aj = V{yj)—V{xj) of Parikh vectors 

of 

Xj = w[i,j] and y 3 = w[j + 1, fc] 

where k = head Note that Aj[a] > for any a = 1,2, . . . ,m. 

Assume we have computed head'(i,j — 1) and Aj_i. When we proceed to 
j, we move the letter w[j] from y to x and update A accordingly. Thus at most 
one element of A might have dropped below 0. If there is no such element, we 
conclude that head = head — 1) and that we have obtained Aj = A. 
Otherwise we keep extending y to the right with new letters and updating A 
until all its elements become non-negative. We obtain the following algorithm 
Compute- head. 

Lemma 3. The head table can be computed in 0(n 2 ) time. 

Proof. The time complexity of the algorithm Compute- head is 0(n 2 ). Indeed, 
the total number of steps of the while-loop for a fixed value of i is 0(n), since 
each step increases the variable fc. □ 

We also use the following tail table that is analogical to the head table: 

tail(i,j) — minimum k such that V(w[i,j]) < V(w[i — k,i — 1]). 
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Algorithm Compute- head (w) 
for i := 1 to n do 

A:= (0,0,. ..,0); 

A[iw[i]] := 1; {Boundary condition} 

k := i; 

for j := i to n do 

A[w\j}] := A[w[j}} - 2; 
while (k < n) and (A[w[j]] < 0) do 
k := k + 1; 

A[w[fc]] := A[w[fc]] + 1; 
if A[w[j]] < then k := oo; 

head'(i,j) := k; head(i,j) := head'(i,j) — j; 



3. Abelian squares and Abelian periods 

In this section we show how Abelian periods can be inferred from Abelian 
squares in a string. 

Define by maxpower(i,p) the maximal size of a prefix of w[i, n] which is an 
Abelian fc-power with base p (for some k). Define square(i,p) — 1 if and only 
if maxpower(i,p) > 2p. Cummings and Smyth compute an alternative table 
square' '(i,p), such that square 1 {i,p) = 1 if and only if w[i — p + 1, i + p] is an 
Abelian square. These tables are clearly equivalent: 



The maxpower(i,p) table can be computed from the square(i,p) table in linear 
time using a simple dynamic programming recurrence: 



An alternative 0(n 2 ) time algorithm for computing the table square(i,p) for 
a string w of length n is a consequence of the following observation, see also 
Example Q] 

Observation 4. square(i,p) = 1 <^> head(i,i +p — 1) = p. 

Theorem 5. All Abelian squares in a string of length n can be computed in 
0(n 2 ) time. 

The following observation provides a constant-time condition for checking 
an Abelian period. 



square(i,p) =1 square' {i + p — l,p) 



1. 




(1) 
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Observation 6. (i,p) is an Abelian period of w if and only if 

p > head(l,i), tail(j, n) 

where j = i + 1 + maxpower(i + l,p). 

We conclude with the following algorithm for computing Abelian periods. In 
the algorithm we use our alternative version of computing the table square from 
head, since the latter table is computed anyway (instead of that Cummings and 
Smyth's algorithm can be used for Abelian squares). 



Algorithm Compute- Abelian-Periods 

Compute head(i,j), tail(i,j) using algorithm Compute-/iea<i; 
Initialize the table maxpower to zero table; 
for p := 1 to n do 

for i := n downto 1 do 
if i <n — p + 1 then 
maxpower (i,p) := p; 
if head(i, i + p — 1) = p then 

maxpower (i,p) := p + maxpower(i + p,p); 
for i := to n — 1 do 
for p := 1 to n — i do 

j := i + 1 + maxpower(i + l,p); 
if (p > head(l,i)) and (p > tail(j,n)) then 
Report an Abelian period (i,p); 



Theorem 7. All Abelian periods of a string of length n can be computed in 
0(n 2 ) time. 

4. Long Abelian periods 



In this section we show how to compute the table MinLong(i), see the ex- 
ample in the table below. 



i 
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4 


5 


6 


7 


8 


9 


10 


11 


12 


13 


w[i] 




c 


a 


a 


b 


b 


c 


a 


b 


b 


c 


a 


a 


a 


MinLong(i) 


7 


7 


9 


8 


7 


7 


oo 


oo 


oo 


oo 


oo 


oo 


oo 


oo 



For a non-decreasing function / : {1, 2, . . . , n + 1} — > {— oo}L){l, 2, . . . , n + 1} 
define the function 

/OO = min{j : f(j) > i}. 
If the minimum is undefined then we set f(i) = oo. 
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Observation 8. Let f be a function non- decreasing and computable in constant 
time. Then all the values of f can be computed in linear time. 

Theorem 9. A compact representation of all long Abelian periods can be com- 
puted in linear time. 

Proof. Let us take f{j) = j — tail(j,n). This function is non-decreasing, see 
also Observation [5] Then for i < % we have: 



MinLong{i) — max | —J +1, head(l,i), f(i) 
and otherwise MinLong(i) = oo, see also Fig.[5J 



i-1 



p = j - i - 1 




to[l,i] w\i + X,j — 1] uj[j, n] 

<r > 

tail(j, n) 

< > 

head(l, i) 

Figure 2: A schematic view of a long Abelian period: p > l|, p > head(l, i), tail(j, n). 

Hence the computation of MinLong table is reduced to linear time algorithm 
for / and the conclusion of the theorem follows from Observation |8] □ 
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